SEQUENCE LISTING 



<110> Evans, Ronald 

Blumberg, Bruce 

<120> NOVEL STEROID -ACTIVATED NUCLEAR 
RECEPTORS AND USES THEREFOR 

<130> SALK2270-2 

<140> 09/458,366 
<141> 1999-12-09 

<150> 09/005,286 
<151> 1998-01-09 

<160> 39 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 2068 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (583) . . . (1884) 

<223> N is selected from A, C, T/U or G 



<400> 1 

ggcacgagga 

cagactgatg 

ccttttcctg 

agaacaaata 

gggcaatgga 

gattgtctca 

ctttacttca 

caagagaata 

ccttgatcga 

aagaacttac 



gatctaggtt 
aaatgcgctc 
tgtttttgta 
cggtaatctc 
gccgcttagt 
tatccgggga 
gtgggaatct 
agctaatact 
tcctttgcac 
caccaagcag 



caaattaatg 
agaattactt 
gtgaagagac 
ttcatttgct 
gcctacatct 
aatcataacc 
cggcctcagc 
cctgtcctga 
cggattgttc 
tccaagaggc 



ttgcccctag 
agacaaagcg 
ctgaaagaaa 
agttcaagtg 
gacttggact 
tatgactagg 
ctgcaagcca 
acaaggcagc 
aaagtggacc 
ccagaagcaa 



tggtaaagga 
gatatttgcc 
aaagtaggga 
ctggacttgg 
gaaatatagg 
acgggaagag 
agtgttcaca 
ggctccttgg 
ccaggggaga 
ac ctg gag 
Leu Glu 
1 



cagagaccct 

actctcttcc 

gaacataatg 

gacttaggag 

tgagagacaa 

gaagcactgc 

gtgagaaaag 

taaagctact 

agtcggagca 

gtg aga 

Val Arg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
594 



ccc aaa gaa age tgg aac cat get gac ttt gta cac tgt gag gac aca 
Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val His Cys Glu Asp Thr 
5 10 15 20 



642 



gag tct gtt cct gga aag ccc agt gtc aac gca gat gag gaa gtc gga 
Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala Asp Glu Glu Val Gly 
25 30 35 



690 



ggt ccc caa ate tgc cgt gta tgt ggg gac aag gec act ggc tat cac 
Gly Pro Gin lie Cys Arg Val Cys Gly Asp Lys Ala Thr Gly Tyr His 
40 45 50 



738 



ttc aat gtc atg aca tgt gaa gga tgc aag ggc ttt ttc agg agg gec 
Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ala 



786 




atg aaa cgc aac gcc. egg ctg agg tgc ccc ttc egg aag ggc gec tgc 834 
Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg Lys Gly Ala Cys 
70 75 80 

gag ate ace egg aag ace egg cga cag tgc cag gcc tgc cgc ctg cgc 882 
Glu lie Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala Cys Arg Leu Arg 
85 90 95 100 

aag tgc ctg gag age ggc atg aag aag gag atg ate atg tec gac gag 93 0 

Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met lie Met Ser Asp Glu 
105 110 115 

gcc gtg gag gag agg egg gcc ttg ate aag egg aag aaa agt gaa egg 97 8 

Ala Val Glu Glu Arg Arg Ala Leu He Lys Arg Lys Lys Ser Glu Arg 
120 125 130 

aca ggg act cag cca ctg gga gtg cag ggg ctg aca gag gag cag egg 1026 
Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr Glu Glu Gin Arg 
135 140 145 

atg atg ate agg gag ctg atg gac get cag atg aaa ace ttt gac act 1074 
Met Met He Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp Thr 
150 155 160 

ace ttc tec cat ttc aag aat ttc egg ctg cca ggg gtg ctt age agt 1122 
Thr Phe Ser His Phe Lys Asn Phe Arg Leu Pro Gly Val Leu Ser Ser 
165 170 175 180 

ggc tgc gag ttg cca gag cct ctg cag gcc cca teg agg gaa gaa get 1170 
Gly Cys Glu Leu Pro Glu Pro Leu Gin Ala Pro Ser Arg Glu Glu Ala 
185 190 195 

gcc aag tgg age cag gtc egg aaa gat ctg tgc tct ttg aag* gtc tct 1218 
Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser Leu Lys Val Ser 
200 205 210 

ctg caa get gcg ggg gga gga tgg cag tgt ctg gaa eta caa acn ccc 1266 
Leu Gin Ala Ala Gly Gly Gly Trp Gin Cys Leu Glu Leu Gin Xaa Pro 
215 220 225 

age cga cag tgg egg aaa gag ate ttc tec ctg ctg ccc cac atg get 1314 
Ser Arg Gin Trp Arg Lys Glu He Phe Ser Leu Leu Pro His Met Ala 
230 235 240 

gac atg tea acc tac atg ttc aaa ggc ate ate age ttt gcc aaa gtc 1362 
Asp Met Ser Thr Tyr Met Phe Lys Gly He He Ser Phe Ala Lys Val 
245 250 255 260 

ate tec tac ttc agg gac ttg ccc ate gag gac cag ate tec ctg ctg 1410 
He Ser Tyr Phe Arg Asp Leu Pro He Glu Asp Gin He Ser Leu Leu 
265 270 275 

aag ggg gcc get ttc gag ctg tgt caa ctg aga ttc. aac aca gtg ttc 1458 
Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe 
280 285 290 

aac gcg gag act gga acc tgg gag tgt ggc egg ctg tec tac tgc ttg 1506 
Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu 



295 300 305 

gaa gac act gca ggt ggc ttc cag caa ctt eta ctg gag ccc atg ctg 1554 
Glu Asp Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu Glu Pro Met Leu 
310 315 320 

aaa ttc cac tac atg ctg aag aag ctg cag ctg cat gag gag gag tat 1602 
Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu His Glu Glu Glu Tyr 
325 330 335 340 

gtg ctg atg cag gec ate tec etc ttc tec cca gac cgc cca ggt gtg 1650 
Val Leu Met Gin Ala lie Ser Leu Phe Ser Pro Asp Arg Pro Gly Val 
345 350 355 

ctg cag cac cgc gtg gtg gac cag ctg cag gag caa ttc gee att act 1698 
Leu Gin His Arg Val Val Asp Gin Leu Gin Glu Gin Phe Ala lie Thr 
360 365 370 

ctg aag tec tac att gaa tgc aat egg ccc cag cct get cat agg ttc 1746 
Leu Lys Ser Tyr lie Glu Cys Asn Arg Pro Gin Pro Ala His Arg Phe 
375 380 385 

ttg ttc ctg aag ate atg get atg etc acc gag etc cgc age ate aat 1794 
Leu Phe Leu Lys lie Met Ala Met Leu Thr Glu Leu Arg Ser lie Asn 
390 395 400 

get cag cac acc cag egg ctg ctg cgc ate cag gac ata cac ccc ttt 1842 
Ala Gin His Thr Gin Arg Leu Leu Arg lie Gin Asp lie His Pro Phe 
405 410 415 420 

get acg ccc etc atg cag gag ttg ttc ggc ate aca ggt age 1884 
Ala Thr Pro Leu Met Gin Glu Leu Phe Gly lie Thr Gly Ser 
425 430 

tgagcggctg ccttgggtga caccttcgag aggcagecag acccagagcc ctctgagccg 1944 

gcactcccgg gecaagacag atggacactg ccaagagccg acaatgccct gctggcctgt 2004 

ctccctaggg aattcctget atgacagctg gctagcattc ctcaggaagg acatggggtg 2064 

cccc 2068 

<210> 2 
<211> 434 
<212> PRT 

<213> Homo sapiens 
<220> 

<22 3> Xaa is threonine 
<400> 2 

Leu Glu Val Arg Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val His 

15 10 15 

Cys Glu Asp Thr Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala Asp 

20 25 30 

Glu Glu Val Gly Gly Pro Gin lie Cys Arg Val Cys Gly Asp Lys Ala 

35 40 45 

Thr Gly Tyr His Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe 

50 55 60 

Phe Arg Arg Ala Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg 
65 70 75 80 

Lys Gly Ala Cys Glu lie Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala 
85 90 95 



4 



Cys 


Arg 


Leu 


Arg 
100 


Lys 


Cys 


Leu 


Glu 


Ser 
105 


Gly 


Met 


Lys 


Lys 


Glu 
110 


Met 


He 


Met 


Ser 


Asp 
115 


Glu 


Ala 


Val 


Glu 


Glu 
120 


Arg 


Arg 


Ala 


Leu 


He 
125 


Lys 


Arg 


Lys 


Lys 


Ser 
130 


Glu 


Arg 


Thr 


Gly 


Thr 
135 


Gin 


Pro 


Leu 


Gly 


Val 
140 


Gin 


Gly 


Leu 


Thr 


Glu 


Glu 


Gin 


Arg 


Met 


Met 


He 


Arg 


Glu 


Leu 


Met 


Asp 


Ala 


Gin 


Met 


Lys 


145 










150 










155 










160 


Thr 


Phe 


Asp 


Thr 


Thr 
165 


Phe 


Ser 


His 


Phe 


Lys 
170 


Asn 


Phe 


Arg 


Leu 


Pro 
175 


Gly 


Val 


Leu 


Ser 


Ser 
180 


Gly 


Cys 


Glu 


Leu 


Pro 
185 


Glu 


Pro 


Leu 


Gin 


Ala 
190 


Pro 


Ser 


Arg 


Glu 


Glu 
195 


Ala 


Ala 


Lys 


Trp 


Ser 
200 


Gin 


Val 


Arg 


Lys 


Asp 
205 


Leu 


Cys 


Ser 


Leu 


Lys 
210 


Val 


Ser 


Leu 


Gin 


Ala 
215 


Ala 


Gly 


Gly 


Gly 


Trp 
220 


Gin 


Cys 


Leu 


Glu 


Leu 


Gin 


Xaa 


Pro 


Ser 


Arg 


Gin 


Trp 


Arg 


Lys 


Glu 


He 


Phe 


Ser 


Leu 


Leu 


225 










230 










235 










240 


Pro 


His 


Met 


Ala 


Asp 
245 


Met 


Ser 


Thr 


Tyr 


Met 
250 


Phe 


Lys 


Gly 


He 


He 
255 


Ser 


Phe 


Ala 


Lys 


Val 
260 


He 


Ser 


Tyr 


Phe 


Arg 
265 


Asp 


Leu 


Pro 


He 


Glu 
270 


Asp 


Gin 


lie 


Ser 


Leu 
275 


Leu 


Lys 


Gly 


Ala 


Ala 
280 


Phe 


Glu 


Leu 


Cys 


Gin 
285 


Leu 


Arg 


Phe 


Asn 


Thr 
290 


Val 


Phe 


Asn 


Ala 


Glu 
295 


Thr 


Gly 


Thr 


Trp 


Glu 
300 


Cys 


Gly 


Arg 


Leu 


Ser 


Tyr 


Cys 


Leu 


Glu 


Asp 


Thr 


Ala 


Gly 


Gly 


Phe 


Gin 


Gin 


Leu 


Leu 


Leu 


305 










310 










315 










320 


Glu 


Pro 


Met 


Leu 


Lys 
325 


Phe 


His 


Tyr 


Met 


Leu 
330 


Lys 


Lys 


Leu 


Gin 


Leu 
335 


His 


Glu 


Glu 


Glu 


Tyr 
340 


Val 


Leu 


Met 


Gin 


Ala 
345 


He 


Ser 


Leu 


Phe 


Ser 
350 


Pro 


Asp 


Arg 


Pro 


Gly 
355 


Val 


Leu 


Gin 


His 


Arg 
360 


Val 


Val 


Asp 


Gin 


Leu 
365 


Gin 


Glu 


Gin 


Phe 


Ala 
370 


He 


Thr 


Leu 


Lys 


Ser 
375 


Tyr 


He 


Glu 


Cys 


Asn 
380 


Arg 


Pro 


Gin 


Pro 


Ala 


His 


Arg 


Phe 


Leu 


Phe 


Leu 


Lys 


He 


Met 


Ala 


Met 


Leu 


Thr 


Glu 


Leu 


385 










390 










395 










400 


Arg 


Ser 


He 


Asn 


Ala 
405 


Gin 


His 


Thr 


Gin 


Arg 
410 


Leu 


Leu 


Arg 


He 


Gin 
415 


Asp 


lie 


His 


Pro 


Phe 
420 


Ala 


Thr 


Pro 


Leu 


Met 
425 


Gin 


Glu 


Leu 


Phe 


Gly 
430 


He 


Thr 



Gly Ser 



<210> 3 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rCYP3Al 

<400> 3 

tagacagttc atgaagttca tctac 

<210> 4 
<211> 25 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rCYP3A2 

<400> 4 

taagcagttc ataaagttca tctac 

<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rUGTlA6 

<400> 5 

actgtagttc ataaagttca catgg 

<210> 6 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rbCYP2Cl 

<400> 6 

caatcagttc aacagggttc accaat 

<210> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rP45 0R 

<400> 7 

cacaggtgag ctgaggccag cagcaggtcg aaa 

<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rCYP2Al 

<400> 8 

gtgcaggttc aactggaggt caacatg 

<210> 9 
<211> 27 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rCYP2A2 

<400> 9 

gtgctggttc aactggaggt cagtatg 

<210> 10 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hy doxy 1 a s e , r C YP 2 C 6 

<400> 10 

agtctagttc agtgggggtt cagtctt 

<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxy 1 a s e , hC YP2 E 1 

<400> 11 

gagatggttc aaggaagggt cattaac 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 0 nucleotides 
<400> 12 

catagtcagg tcaaggtcag atcaac 

<210> 13 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 1 nucleotides 
<400> 13 

catagtcagg tcataggtca gatcaac 

<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> direct repeat with spacer of 2 nucleotides 
<400> 14 

catagtcagg tcaataggtc agatcaac 

<210> 15 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 3 nucleotides 
<400> 15 

catagtcagg tcatataggt cagatcaac 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 4 nucleotides 
<400> 16 

catagtcagg tcatataagg tcagatcaac 

<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 5 nucleotides 
<400> 17 

catagtcagg tcatatatag gtcagatcaa c 

<210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 6 nucleotides 
<400> 18 

catagtcagg tcatatataa ggtcaagatc aac 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 7 nucleotides 
<400> 19 

catagtcagg tcatatatat aggtcagatc aac 



8 

<210> 20 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 10 nucleotides 
<400> 20 

catagtcagg tcatatatat ataaggtcag atcaac 

<210> 21 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 15 nucleotides 
<400> 21 

catagtcagg tcatagtagt agtagtagag gtcagatcaa c 

<210> 22 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> repeat_unit 
<222> (7) . . . (7) 

<223> N is a nucleotide spacer of 3 , 4 or 5 nucleotides, 
wherein each N is independently selected from A, 
T, C or G 

<223> example of a response element suitable for 
practice of the invention method 

<400> 22 
agttcantga act 

<210> 23 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> repeat_uhit 
<222> (7) . . . (7) 

<223> N is a nucleotide spacer of 6 nucleotides, wherein 
each N is independently selected from A, T, C or G 

<223> example of a response element suitable for 
practice of the invention method 

<400> 23 
tgaactnagg tea 

<210> 24 
<211> 18 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat with 6 nucleotide spacer found in 
CYP3A4 

<400> 24 

tgaactcaaa ggaggtca 18 

<210> 25 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 0 
nucleotides 

<400> 25 

agcttaggtc atgaccta 18 

<210> 26 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 1 
nucleotides 

<400> 26 

agcttaggtc agtgaccta 19 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 2 
nucleotides 

<400> 27 

agcttaggtc acgtgaccta 2 0 

<210> 28 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> inverted repeat response element with spacer of 3 
nucleotides 

<400> 28 

agcttaggtc acagtgacct a 21 



<210> 29 
<211> 22 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> inverted repeat response element with spacer of 
nucleotides 

<400> 29 

agcttaggtc acatgtgacc ta 

<210> 30 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 
nucleotides 

<400> 30 

agcttaggtc acactgtgac eta 

<210> 31 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 
nucleotides 

<400> 31 

agctttgaac tcaaaggagg tea 

<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> IR-M 
<400> 32 

agettaegtc atgacgta 

<210> 33 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CYP3A oligonucleotide, CYP3A4 , tested for bindii 

<400> 33 

tagaatatga actcaaagga ggtcagtgag tgg 

<210> 34 

<211> 33 

<212> DNA 

<213> Artificial Sequence 



11 

<220> 

<223> CYP3A oligonucleotide, CYP3A5 , tested for binding 
<400> 34 

tagaatatga actcaaagga ggtaagcaaa ggg 3 3 

<210> 35 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CYP3A oligonucleotide, CYP3A7 , tested for binding 
<400> 35 

tagaatatta actcaatgga ggcagtgagt gg 32 

<210> 36 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 36 

gagcaattcg ccattactct gaagt 25 

<210> 37 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 37 

gtccttgggg tcttctacct ttctc 25 

<210> 38 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 38 

gacgatttgg atctggacat gttgg 25 

<210> 39 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 39 



J* 



gttttcatct gagcgtccat cagct 



